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Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims: 

1 . ( Currently amended) A method for processing data representing documents, 
comprising: 

for individual documents of a set of documents, executing a software 
program to obtain a list of salient terms found in each document; 

comparing the list of salient terms for a first document to the list of salient 
terms for a second document; and 

declaring the first document to be substantially identical to, or 
substantially similar to, the second document if some predetermined number of 
salient terms are found in each of the lists of the first document and the second 
document, 

wherein the comparing includes sorting the documents into a document 
list in order of increasing size, and where the step of comparing compares the list 
of salient terms for a given document with the list of salient terms for the next 
larger-documents in the document list. 

2. (Currently Amended) A method as in claim 1, wherein if the predetermined number is 
about 90% of the salient terms or greater the first document is declared to be substantially 
identical to the second document. 

3. (Original) A method as in claim 1, wherein the set of documents is obtained in 
response to a search query made to a data communications network. 

4. (Previously presented) A method as in claim 1, and further comprising storing the 
lists of salient terms in a database. 
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5. (Original) A method as in claim 1, and further comprising computing a signature for 
each document, and storing the computed document signature. 

6. (Previously presented) A method as in claim 1, and further comprising computing a 
signature for each document, and storing the computed document signature in association 
with the list of salient terms for each document. 

7. (Previously presented) A method as in claim 1, wherein the step of executing a 
software program assigns to each salient term a collection-level importance ranking or 
Information Quotient (IQ), and wherein the IQ is considered during the step of 
comparing. 

8. (Currently Amended) A method for processing data representing documents, 
comprising: 

for individual documents of a set of documents, executing a software 
program to obtain a list of terms found in each document; 

comparing the list of terms for a first document to the list of terms for a 
second document; 

declaring the first document to be substantially identical to, or 
substantially similar to, the second document if some predetermined number of 
terms are found in each of the lists of the first document and the second 
document; and 

wherein the step of comparing includes a preliminary step of sorting the 
documents into a document list in order of increasing size, and where the step of 
comparing compares the list of terms for a given document with the list of terms for the 
for the next larger-documents in the document list. 

9. (Currently Amended) A method for processing data representing documents, 

comprising: 

for individual documents of a set of documents, executing a software 
program to obtain a list of terms found in each document; 
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comparing the list of terms for a first document to the list of terms for a 
second document; 

declaring the first document to be substantially identical to, or 
substantially similar to, the second document if some predetermined number of 
terms are found in each of the lists of the first document and the second 
document; and 

wherein the step of comparing includes a preliminary step of sorting the 
documents into a document list in order of increasing size, and where the step of 
comparing compares the list of terms of a given document only with the list of terms 
of another document in the list that is no more than a predetermined amount larger 
than the given document. 

10. (Currently amended) A method for processing data representing documents, 
comprising: 

for individual ones of documents, executing a software program to obtain 
a list of salient terms found in each document; 

computing a document signature for each document from the list of salient 
terms obtained for the document; 

comparing the document signature for a first document to the document 
signature for a second document; and 

declaring the first document to be substantially identical to the second 
document if the document signatures are substantially equal, 

wherein the step of comparing includes a preliminary step of sorting the 
documents into a document list in order of increasing size, and where the step of 
comparing compares the list of salient terms of a given document with the list of 
salient terms of the next larger-documents in the document list. 

1 1 . (Previously presented) A method as in claim 10, wherein the step of computing a 
document signature computes 
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a hash code for each term of the list of salient terms, and then sums all of the hash codes 
to form the document signature. 

12. (Original) A method as in claim 10, wherein the documents are obtained in 
response to a search query made to a data communications network, and where the steps 
of comparing and declaring are executed in substantially real time as the documents are 
returned by the query. 

13. (Original) A method as in claim 10, wherein the documents are obtained in response 
to a search query made to a data communications network, where the steps of comparing 
and declaring are executed in substantially real time as the documents are received from 
the data communications network, and for a case where a received document is found to 
be substantially identical to an already received document, returning only one of the 
documents in response to the search query. 

14. (Original) A method as in claim 10, and further comprising storing the computed 
document signatures in a database. 

15. (Previously presented) A method as in claim 10, and further comprising storing the 
computed document signature in association with the list of salient terms for each 
documents 

16. (Currently amended) A system for processing data representing documents 
comprising, for individual documents of a set of documents, a processor for e x e cuting 
that executes a software program to obtain a list of salient terms found in each document 
and for comparing that compares the list of salient terms for a first document to the list of 
salient terms for a second document, said processor b e ing op e rable for d e claring ^m 
response to executing the software program, declaring the first document to be 
substantially identical to, or substantially similar to, the second document if some 
predetermined number of salient terms are found in each of the lists of the first document 
and the second document, wherein said comparing includes sorting the documents into a 
document list in order of increasing size, and where said comparing compares the list of 
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salient terms for a given document with the the list of salient terms for the next larger- 
documents in the document list. 

17. (Currently Amended) A system as in claim 16, wherein if the predetermined number 
is about 90% of the salient terms or greater the first document is declared to be 
substantially identical to the second document. 

18. (Original) A system as in claim 16, wherein the set of documents is obtained in 
response to a search query made to a data communications network. 

19. (Previously presented) A system as in claim 16, and further comprising a memory 
containing a database for storing the lists of salient terms. 

20. (Original) A system as in claim 16, said processor being further operable for 
computing a signature for each document, and further comprising a memory for storing 
the computed document signature. 

21 . (Previously presented) A system as in claim 16, said processor being further operable 
for computing a signature for each document, and further comprising a memory for 
storing the computed document signature in association with the list of salient terms for 
each document. 

22. (Previously presented) A system as in claim 16, wherein said processor, when 
executing the software program, assigns to each term a collection-level importance 
ranking or Information Quotient (IQ), and wherein the IQ is considered by said processor 
when comparing the list of salient terms for the first document to the list of salient terms 
for the second document. 

23. (Currently Amended) A system for processing data representing documents 

comprising, 
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for individual documents of a set of documents, a processor for e x e cuting that executes a 
software program to obtain a list of terms found in each document and for comparing 
compares the list of terms for a first document to the list of terms for a second document, 
said processor being operable for d e claring operating to declare the first document to be 
substantially identical to, or substantially similar to, the second document if some 
predetermined number of terms are found in each of the lists of the first document and the 
second document; and wherein said processor is further operable, before comparing the 
lists of terms, to sort the documents into a document list in order of increasing size, and 
to then compare the list of terms of a given document with the list of terms of a next 
larger documents in the document list. 

24. (Currently Amended) A system for processing data representing documents 

comprising, 

for individual documents of a set of documents, a processor for ex e cuting that executes a 
software program to obtain a list of terms found in each document and for comparing 
compares the list of terms for a first document to the list of terms for a second document, 
said processor being op e rabl e for declaring 3 in response to executing the software 
program, declaring the first document to be substantially identical to, or substantially 
similar to, the second document if some predetermined number of terms are found in 
each of the lists of the first document and the second document; and wherein said 
processor is further operable , before comparing the lists of terms, te sorts the documents 
into a document list in order of increasing size, and to th e n compares the list of terms for 
a given document only with the list of terms for another document in the list that is no 
more than a predetermined amount larger than the given document. 

25. (Currently amended) A system for processing data representing documents, 
comprising, for individual documents of a set of documents, a processor for ex e cuting 
that executes a software program to obtain a list of salient terms found in each document, 
for computing computes a document signature for each document from the list of salient 
terms obtained for the document; for comparing compares the document signature for a 
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first document to the document signature for a second document; and for declaring 
declares the first document to be substantially identical to the second document if the 
document signatures are equal, wherein said comparing includes sorting the documents 
into a document list in order of increasing size, and where said comparing compares the 
list of salient terms for a given document with the list of salient terms for the next larger- 
documents in the document list. 

26. (Previously presented) A system as in claim 25, wherein said processor computes the 
document signature by computing a hash code for each salient term of the list of salient 
terms, and summing all of the hash codes to form the document signature. 

27. (Original) A system as in claim 25, wherein the documents are obtained in response 
to a search query made to a data communications network, and where the processor 
executes the comparing and declaring functions in substantially real time as the 
documents are returned by the query. 

28. (Original) A system as in claim 25, wherein the documents are obtained in 
response to a search query made to a data communications network, where the processor 
executes comparing and declaring functions in substantially real time as the documents 
are received from the data communications network, and for a case where a received 
document is found to be substantially identical to an already received document, said 
processor returns only one of the documents in response to the search query. 

29. (Original) A system as in claim 25, and further comprising a memory containing a 
database for storing the computed document signatures. 

30. (Original) A system as in claim 29, and further comprising storing the computed 
document signature in association with the list of terms for each document. 

31. (Currently amended) A computer program recorded on a computer-readable media, 
said computer program comprising instructions for directing a data processor to process 
data representing documents by, for individual documents of a set of documents, 
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obtaining a list of salient terms found in each document; comparing the list of salient 
terms for a first document to the list of salient terms for a second document; and 
declaring the first document to be substantially identical to, or substantially similar to, the 
second document if some predetermined number of salient terms are found in each of the 
lists of the first document and the second document, wherein said comparing includes 
sorting the documents into a document list in order of increasing size, and where said 
comparing compares the list of salient terms for a given document with the list of salient 
terms for the next larger-documents in the document list.. 

32. (Currently amended) A computer program recorded on a computer-readable media, 
said computer program comprising instructions for directing a data processor to process 
data representing documents by, for individual ones of documents, obtaining a list of 
salient terms found in each document; computing a document signature for each 
document from the list of salient terms obtained for the document; comparing the 
document signature for a first document to the document signature for a second 
document; and declaring the first document to be substantially identical to the second 
document if the document signatures are equal, wherein said comparing includes sorting 
the documents into a document list in order of increasing size, and where said comparing 
compares the list of salient terms for a given document with the list of salient terms for 
the next larger-documents in the document list. 

33. (Previously presented) A computer program as in claim 32, wherein the document 
signature is computed by computing a hash code for each salient term of the list of salient 
terms, and summing together all of the hash codes to form the document signature. 
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